Egocentric Pose Recognition in Four Lines of Code
نویسندگان
چکیده
We tackle the problem of estimating the 3D pose of an individual’s upper limbs (arms+hands) from a chest mounted depth-camera. Importantly, we consider pose estimation during everyday interactions with objects. Past work shows that strong pose+viewpoint priors and depth-based features are crucial for robust performance. In egocentric views, hands and arms are observable within a well defined volume in front of the camera. We call this volume an egocentric workspace. A notable property is that hand appearance correlates with workspace location. To exploit this correlation, we classify arm+hand configurations in a global egocentric coordinate frame, rather than a local scanning window. This greatly simplify the architecture and improves performance. We propose an efficient pipeline which 1) generates synthetic workspace exemplars for training using a virtual chest-mounted camera whose intrinsic parameters match our physical camera, 2) computes perspective-aware depth features on this entire volume and 3) recognizes discrete arm+hand pose classes through a sparse multiclass SVM. Our method provides state-of-the-art hand pose recognition performance from egocentric RGB-D images in real-time.
منابع مشابه
Egocentric Activity Recognition Using Bag of Visual Words
This paper presents an approach for recognizing activities using video from the egocentric setup. In this approach instead of using intermediate setup like object detection, pose estimation, modeling spatial distribution of visual words is implemented. The interactions are encoded by using Histogram oriented Pairwise Relation named (HOPR) between the visual words, orientations and alignments. A...
متن کاملTrajectory aligned features for first person action recognition
Egocentric videos are characterised by their ability to have the first person view. With the popularity of Google Glass and GoPro, use of egocentric videos is on the rise. Recognizing action of the wearer from egocentric videos is an important problem. Unstructured movement of the camera due to natural head motion of the wearer causes sharp changes in the visual field of the egocentric camera c...
متن کامل3D Hand Pose Detection in Egocentric RGB-D Images
We focus on the task of everyday hand pose estimation from egocentric viewpoints. For this task, we show that depth sensors are particularly informative for extracting near-field interactions of the camera wearer with his/her environment. Despite the recent advances in full-body pose estimation using Kinect-like sensors, reliable monocular hand pose estimation in RGB-D images is still an unsolv...
متن کامل3D Face Recognition using Patch Geodesic Derivative Pattern
In this paper, a novel Patch Geodesic Derivative Pattern (PGDP) describing the texture map of a face through its shape data is proposed. Geodesic adjusted textures are encoded into derivative patterns for similarity measurement between two 3D images with different pose and expression variations. An extensive experimental investigation is conducted using the publicly available Bosphorus and BU-3...
متن کاملModel Based Object Pose in Lines of Code
In this paper we describe a method for nding the pose of an object from a single image We assume that we can detect and match in the image four or more noncoplanar feature points of the object and that we know their relative geometry on the object The method combines two algorithms the rst algorithm POS Pose from Orthography and Scaling approximates the perspective projection with a scaled orth...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1412.0060 شماره
صفحات -
تاریخ انتشار 2014